1 



SEQUENCE LISTING 



<110> Evans, Ronald M. 
Blumberg, Bruce 

<120> NOVEL STEROID -ACTIVATED NUCLEAR 
RECEPTORS AND USES THEREFOR 

<130> SALK2270-2 

<140> 09/458,366 
<141> 1999-12-09 



<150> 09/005,286 
<151> 1998-01-09 



<160> 39 



<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 2068 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (583) . • . (1884) 

<223> N is selected from A, C, T/U or G 



<400> 1 

ggcacgagga gatctaggtt caaattaatg ttgcccctag tggtaaagga cagagaccct 
cagactgatg aaatgcgctc agaattactt agacaaagcg gatatttgcc actctcttcc 
ccttttcctg tgtttttgta gtgaagagac ctgaaagaaa aaagtaggga gaacataatg 
agaacaaata cggtaatctc ttcatttgct agttcaagtg ctggacttgg gacttaggag 
gggcaatgga gccgcttagt gcctacatct gacttggact gaaatatagg tgagagacaa 
gattgtctca tatccgggga aatcataacc tatgactagg acgggaagag gaagcactgc 
ctttacttca gtgggaatct cggcctcagc ctgcaagcca agtgttcaca gtgagaaaag 
caagagaata agctaatact cctgtcctga acaaggcagc ggctccttgg taaagctact 
ccttgatcga tcctttgcac cggattgttc aaagtggacc ccaggggaga agtcggagca 
aagaacttac caccaagcag tccaagaggc ccagaagcaa ac ctg gag gtg aga 

Leu Glu Val Arg 



1 



ccc aaa gaa age tgg aac cat get gac ttt gta cac tgt gag gac aca 
Pro Lys Glu Ser Trp Asn His Ala Asp Phe Val His Cys Glu Asp Thr 



5 10 



15 20 



gag tct gtt cct gga aag ccc agt gtc aac gca gat gag gaa gtc gga 
Glu Ser Val Pro Gly Lys Pro Ser Val Asn Ala Asp Glu Glu Val Gly 

30 35 



25 



ggt ccc caa ate tgc cgt gta tgt ggg gac aag gec act ggc tat cac 
Gly Pro Gin He Cys Arg Val Cys Gly Asp Lys Ala Thr Gly Tyr His 
40 45 50 

ttc aat gtc atg aca tgt gaa gga tgc aag ggc ttt ttc agg agg gec 
Phe Asn Val Met Thr Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Ala 



60 
120 
180 
240 
300 
360 
420 
480 
540 
594 



642 



690 



738 



786 



55 60 65 

atg aaa cgc aac gcc egg ctg agg tgc ccc ttc egg aag ggc gee tgc 834 
Met Lys Arg Asn Ala Arg Leu Arg Cys Pro Phe Arg Lys Gly Ala Cys 
70 75 80 

gag ate ace egg aag acc egg cga cag tgc cag gcc tgc cgc ctg cgc 882 
Glu lie Thr Arg Lys Thr Arg Arg Gin Cys Gin Ala Cys Arg Leu Arg 
85 90 95 100 

aag tgc ctg gag age ggc atg aag aag gag atg ate atg tec gac gag 
Lys Cys Leu Glu Ser Gly Met Lys Lys Glu Met lie Met Ser Asp Glu 
105 110 115 

gcc gtg gag gag agg egg gcc ttg ate aag egg aag aaa agt gaa egg 97 8 

Ala Val Glu Glu Arg Arg Ala Leu lie Lys Arg Lys Lys Ser Glu Arg 
120 125 130 

aca ggg act cag cca ctg gga gtg cag ggg ctg aca gag gag cag egg 1026 
Thr Gly Thr Gin Pro Leu Gly Val Gin Gly Leu Thr Glu Glu Gin Arg 
135 140 145 



atg atg ate agg gag ctg atg gac get cag atg aaa acc ttt gac act 
Met Met lie Arg Glu Leu Met Asp Ala Gin Met Lys Thr Phe Asp Thr 
150 155 160 

acc ttc tec cat ttc aag aat ttc egg ctg cca ggg gtg ctt age agt 
Thr Phe Ser His Phe Lys Asn Phe Arg Leu Pro Gly Val Leu Ser Ser 
165 170 175 180 



930 



1074 



1122 



ggc tgc gag ttg cca gag cct ctg cag gcc cca teg agg gaa gaa get 1170 
Gly Cys Glu Leu Pro Glu Pro Leu Gin Ala Pro Ser Arg Glu Glu Ala 
185 190 195 

gcc aag tgg age cag gtc egg aaa gat ctg tgc tct ttg aag gtc tct 1218 
Ala Lys Trp Ser Gin Val Arg Lys Asp Leu Cys Ser Leu Lys Val Ser 
200 205 210 

ctg caa get gcg ggg gga gga tgg cag tgt ctg gaa eta caa acn ccc 1266 
Leu Gin Ala Ala Gly Gly Gly Trp Gin Cys Leu Glu Leu Gin Xaa Pro 
215 220 225 

age cga cag tgg egg aaa gag ate ttc tec ctg ctg ccc cac atg get 1314 
Ser Arg Gin Trp Arg Lys Glu lie Phe Ser Leu Leu Pro His Met Ala 
230 235 240 

gac atg tea acc tac atg ttc aaa ggc ate ate age ttt gcc aaa gtc 1362 
Asp Met Ser Thr Tyr Met Phe Lys Gly He He Ser Phe Ala Lys Val 
245 250 255 260 

ate tec tac ttc agg gac ttg ccc ate gag gac cag ate tec ctg ctg 1410 
He Ser Tyr Phe Arg Asp Leu Pro He Glu Asp Gin He Ser Leu Leu 
265 270 275 

aag ggg gcc get ttc gag ctg tgt caa ctg aga ttc aac aca gtg ttc 1458 
Lys Gly Ala Ala Phe Glu Leu Cys Gin Leu Arg Phe Asn Thr Val Phe 
280 285 290 



aac gcg gag act gga acc tgg gag tgt ggc egg ctg tec tac tgc ttg 
Asn Ala Glu Thr Gly Thr Trp Glu Cys Gly Arg Leu Ser Tyr Cys Leu 



1506 



295 



300 



3 



305 



gaa 
Glu 


gac 
Asp 
310 


act 
Thr 


gca 
Ala 


ggt 
Gly 


ggc 
Gly 


ttc 
Phe 
315 


cag 
Gin 


caa 
Gin 


ctt 
Leu 


eta 
Leu 


ctg 
Leu 
320 


gag 
Glu 


ccc 
Pro 


atg 
Met 


ctg 
Leu 


aaa 
Lys 
325 


ttc 
Phe 


cac 
His 


tac 
Tyr 


atg 
Met 


ctg 
Leu 
330 


aag 
Lys 


aag 
Lys 


ctg 
Leu 


cag 
Gin 


ctg 
Leu 
335 


cat 
His 


gag 
Glu 


gag 

Glu 


gag 
Glu 


tat 
Tyr 
340 


crtq 
Val 


ctg 
Leu 


atg 
Met 


cag 
Gin 


gec 
Ala 
345 


ate 
He 


tec 
Ser 


etc 
Leu 


ttc 
Phe 


tec 
Ser 
350 


cca 
Pro 


gac 
Asp 


cgc 
Arg 


cca 
Pro 


ggt 
Gly 
355 


gtg 
Val 


ctg 
Leu 


cag 
Gin 


cac 
His 


cgc 
Arg 
360 


gtg 

Val 


gtg 

Val 


gac 
Asp 


cag 
Gin 


ctg 
Leu 
365 


cag 
Gin 


gag 
Glu 


caa 
Gin 


ttc 
Phe 


gee 
Ala 
370 


att 
He 


act 
Thr 


ctg 
Leu 


aag 
Lys 


tec 
Ser 
375 


tac 
Tyr 


att 
lie 


gaa 
Glu 


tgc 
Cys 


aat 
Asn 
380 


egg 
Arg 


ccc 
Pro 


cag 
Gin 


cct 
Pro 


get 
Ala 
385 


cat 
Hie 


agg 
Arg 


ttc 
Phe 


ttg 
Leu 


ttc 
Phe 
390 


ctg 
Leu 


aag 
Lys 


ate 
He 


atg 
Met 


get 
Ala 
395 


atg 
Met 


etc 
Leu 


acc 
Thr 


gag 
Glu 


etc 
Leu 
400 


cgc 
Arg 


age 
Ser 


ate 
He 


aat 
Asn 


get 
Ala 
405 


cag 
Gin 


cac 
His 


acc 
Thr 


cag 
Gin 


egg 
Arg 
410 


ctg 
Leu 


ctg 
Leu 


cgc 
Arg 


ate 
He 


cag 
Gin 
415 


gac 

Asp 


ata 
He 


cac 
His 


ccc 
Pro 


ttt 
Phe 
420 


get 
Ala 


acg 
Thr 


ccc 
Pro 


etc 
Leu 


atg 
Met 


cag 
Gin 


gag 
Glu 


ttg 
Leu 


ttc 
Phe 


ggc 
Gly 


ate 
He 


aca 
Thr 


ggt 
Gly 


age 
Ser 







425 430 



1554 



1602 



1650 



1698 



1746 



1794 



1842 



1884 



tgagcggctg ccttgggtga caccttcgag aggcagecag acccagagcc ctctgagccg 1944 
gcactcccgg gecaagacag atggacactg ccaagagccg acaatgccct gctggcctgt 2004 
ctccctaggg aattcctget atgacagctg gctagcattc ctcaggaagg acatggggtg 2064 

2068 



cccc 



<210> 2 
<211> 434 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Xaa is threonine 
<400> 2 



Leu 


Glu Val 


Arg 


Pro 


Lys 


Glu 


Ser 


Trp 


Asn 


His 


Ala 


Asp 


Phe 


Val 


His 


1 




5 


s£ 








10 










15 




Cys 


Glu Asp 


Thr 


Glu 


Val 


Pro 


Gly 


Lys 


Pro 


Ser 


Val 


Asn 


Ala 


Asp 


20 










25 










30 






Glu 


Glu Val 


Gly 


Gly 


Pro 


Gin 


He 


Cys 


Arg 


Val 


Cys 


Gly Asp 


Lys 


Ala 




35 








40 










45 






Phe 


Thr 


Gly Tyr 
50 


His 


Phe 


Asn 


Val 
55 


Met 


Thr 


Cys 


Glu 


Gly 
60 


Cys 


Lys 


Gly 


Phe 


Arg Arg 


Ala 


Met 


Lys 


Arg 


Asn 


Ala 


Arg 


Leu 


Arg 


Cys 


Pro 


Phe 


Arg 


65 






70 










75 










80 


Lys 


Gly Ala Cys 


Glu 


He 


Thr 


Arg 


Lys 


Thr 


Arg 


Arg 


Gin 


Cys 


Gin 


Ala 






85 










90 










95 





4 

Cys Arq Leu Arg Lys Cys Leu Glu Ser Gly Met Lys Lys Glu Met He 

100 105 HO 

Met Ser Asp Glu Ala Val Glu Glu Arg Arg Ala Leu He Lys Arg Lys 

115 120 125 

Lys Ser Glu Arg Thr Gly Thr Gin Pro Leu Gly Val Gin Gly Leu Thr 

130 135 140 

Glu Glu Gin Arg Met Met He Arg Glu Leu Met Asp Ala Gin Met Lys 
145 150 155 160 

Thr Phe Asp Thr Thr Phe Ser His Phe Lys Asn Phe Arg Leu Pro Gly 

165 170 175 

Val Leu Ser Ser Gly Cys Glu Leu Pro Glu Pro Leu Gin Ala Pro Ser 

180 185 190 

Arg Glu Glu Ala Ala Lys Trp Ser Gin Val Arg Lys Asp Leu Cys Ser 

195 200 205 

Leu Lys Val Ser Leu Gin Ala Ala Gly Gly Gly Trp Gin Cys Leu Glu 

210 215 220 

Leu Gin Xaa Pro Ser Arg Gin Trp Arg Lys Glu He Phe Ser Leu Leu 
225 230 235 240 

Pro His Met Ala Asp Met Ser Thr Tyr Met Phe Lys Gly He He Ser 

245 250 255 

Phe Ala Lys Val He Ser Tyr Phe Arg Asp Leu Pro He Glu Asp Gin 

260 265 270 

He Ser Leu Leu Lys Gly Ala Ala Phe Glu Leu Cys Gin Leu Arg Phe 

275 280 285 

Asn Thr Val Phe Asn Ala Glu Thr Gly Thr Trp Glu Cys Gly Arg Leu 

290 295 300 

Ser Tyr Cys Leu Glu Asp Thr Ala Gly Gly Phe Gin Gin Leu Leu Leu 
305 310 315 320 

Glu Pro Met Leu Lys Phe His Tyr Met Leu Lys Lys Leu Gin Leu His 

325 330 335 

Glu Glu Glu Tyr Val Leu Met Gin Ala He Ser Leu Phe Ser Pro Asp 

340 345 350 

Arg Pro Gly Val Leu Gin His Arg Val Val Asp Gin Leu Gin Glu Gin 

355 360 365 

Phe Ala He Thr Leu Lys Ser Tyr He Glu Cys Asn Arg Pro Gin Pro 

370 375 380 

Ala His Arg Phe Leu Phe Leu Lys He Met Ala Met Leu Thr Glu Leu 
385 390 395 400 

Arg Ser He Asn Ala Gin His Thr Gin Arg Leu Leu Arg He Gin Asp 

405 410 415 

He His Pro Phe Ala Thr Pro Leu Met Gin Glu Leu Phe Gly He Thr 
420 425 430 

Gly Ser 



<210> 3 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase , rCYP3Al 

<400> 3 

tagacagttc atgaagttca tctac 



<210> 4 
<211> 25 
<212> DNA 



<213> Artificial Sequence 



5 



<220> 

<223> putative SXR response element from the steroid 
hydoxylase , rCYP3 A2 

<400> 4 25 
taagcagttc ataaagttca tctac 

<210> 5 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase , rUGT!A6 

<400> 5 

actgtagttc ataaagttca catgg 

<210> 6 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase, rbCYP2Cl 

<400> 6 26 
caatcagttc aacagggttc accaat 

<210> 7 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase, rP450R 

<400> 7 33 
cacaggtgag ctgaggccag cagcaggtcg aaa 

<210> 8 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase, rCYP2Al 

<400> 8 27 
gtgcaggttc aactggaggt caacatg 

<210> 9 
<211> 27 
<212> DNA 



<213> Artificial Sequence 



6 



<220> 

<223> putative SXR response element from the steroid 
hydroxylase , rCYP2A2 

<400> 9 

gtgctggttc aactggaggt cagtatg 

<210> 10 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase, rCYP2C6 

<400> 10 

agtctagttc agtgggggtt cagtctt 

<210> 11 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> putative SXR response element from the steroid 
hydoxylase, hCYP2El 

<400> 11 

gagatggttc aaggaagggt cattaac 

<210> 12 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 0 nucleotides 
<400> 12 

catagtcagg tcaaggtcag atcaac 

<210> 13 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 1 nucleotides 
<400> 13 

catagtcagg tcataggtca gatcaac 

<210> 14 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> direct repeat with spacer of 2 
<400> 14 

catagtcagg tcaataggtc agatcaac 

<210> 15 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 3 
<400> 15 

catagtcagg tcatataggt cagatcaac 

<210> 16 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 4 
<400> 16 

catagtcagg tcatataagg tcagatcaac 

<210> 17 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 5 
<400> 17 

catagtcagg tcatatatag gtcagatcaa c 

<210> 18 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 6 
<400> 18 

catagtcagg tcatatataa ggtcaagatc aac 

<210> 19 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 7 



7 

nucleotides 

28 

nucleotides 

29 

nucleotides 

30 

nucleotides 

31 

nucleotides 

33 

nucleotides 



<400> 19 

catagtcagg tcatatatat aggtcagatc aac 



33 



8 



<210> 20 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 10 nucleotides 
<400> 20 

catagtcagg tcatatatat ataaggtcag atcaac 

<210> 21 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> direct repeat with spacer of 15 nucleotides 
<400> 21 

catagtcagg tcatagtagt agtagtagag gtcagatcaa c 

<210> 22 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> repeat^ unit 
<222> (7) . . . (7) 

<223> N is a nucleotide spacer of 3, 4 or 5 nucleotides, 
wherein each N is independently selected from A, 
T, C or G 

<223> example of a response element suitable for 
practice of the invention method 

<400> 22 
agttcantga act 

<210> 23 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> repeat_unit 
<222> (7) . . . (7) 

<223> N is a nucleotide spacer of 6 nucleotides, wherein 
each N is independently selected from A, T, C or G 

<223> example of a response element suitable for 
practice of the invention method 

<400> 23 
tgaactnagg tea 

<210> 24 
<211> 18 



9 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat with 6 nucleotide 
CYP3A4 

<400> 24 

tgaactcaaa ggaggtca 

<210> 25 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> * 

<223> inverted repeat response element 
nucleotides 

<400> 25 

agcttaggtc atgaccta 

<210> 26 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat response element 
nucleotides 

<400> 26 

agcttaggtc agtgaccta 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat response element 
nucleotides 

<400> 27 

agcttaggtc acgtgaccta 

<210> 28 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat response element 
nucleotides 

<400> 28 

agcttaggtc acagtgacct a 



spacer found in 




with spacer of 0 



18 



with spacer of 1 



19 



with spacer of 2 



20 



with spacer of 3 



<210> 29 
<211> 22 



<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> inverted repeat response element with spacer of 4 
nucleotides 

<400> 29 

agcttaggtc acatgtgacc ta 

<210> 30 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat response element with spacer of 5 
nucleotides 

<400> 30 

agcttaggtc acactgtgac eta 

<210> 31 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> inverted repeat response element with spacer of 6 
nucleotides 

<400> 31 

agctttgaac tcaaaggagg tea 

<210> 32 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> IR-M 
<400> 32 

agettaegtc atgacgta 

<210> 33 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> CYP3A oligonucleotide, CYP3A4, tested for binding 
<400> 33 

tagaatatga actcaaagga ggtcagtgag tgg 

<210> 34 

<211> 33 

<212> DNA 

<213> Artificial Sequence 



11 



<220> 

<223> CYP3A oligonucleotide, CYP3A5 , tested for binding 
<400> 34 

tagaatatga actcaaagga ggtaagcaaa ggg 

<210> 35 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> CYP3A oligonucleotide, CYP3A7 , tested for binding 



<400> 35 

tagaatatta actcaatgga ggcagtgagt gg 

<210> 36 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



32 



<220> 

<223> oligonucleotide for PCR 
<400> 36 

gagcaattcg ccattactct gaagt 

<210> 37 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for PCR 
<400> 37 

gtccttgggg tcttctacct ttctc 

<210> 38 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for PCR 
<400> 38 

gacgatttgg atctggacat gttgg 

<210> 39 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> oligonucleotide for PCR 
<400> 39 



gttttcatct gagcgtccat cagct 



